RAND 


The  Measurement  and 
Interpretation  of  Health  in 
Social  Surveys 


Duncan  Thomas 
Elizabeth  Frankenberg 


DRU-2550-NIA 


December  2000 


Labor  and  Population  Program 

Working  Paper  Series  01-06 

The  RAND  unrestricted  draft  series  is  intended  to  transmit 
preliminary  results  of  RAND  research.  Unrestricted  drafts 
have  not  been  formally  reviewed  or  edited.  The  views  and 
conclusions  expressed  are  tentative.  A  draft  should  not  be 
cited  or  quoted  without  permission  of  the  author,  unless  the 
preface  grants  such  permission. 

RAND  is  a  nonprofit  institution  that  helps  improve  policy  and  decisionmaking  through  research  and  analysis. 
RAND's  publications  and  drafts  do  not  necessarily  reflect  the  opinions  or  policies  of  its  research  sponsors. 


The  measurement  and  interpretation  of  health  in  social  surveys 


Duncan  Thomas 
RAND  &  UCLA 

and 

Elizabeth  Frankenberg 
RAND 


December,  2000 


Paper  prepared  for  World  Health  Organization  Conference  on  Summary  Measures  for  Population  Health,  Marrakech,  December, 
1999.  Comments  of  Ritu  Sadana  have  been  very  helpful.  Financial  support  from  the  National  Institute  on  Aging  (NIA  P01- 
AG08291),  the  National  Institute  of  Child  Health  and  Human  Development  (NICHD  P01-HD28372)  and  the  National  Science 
Foundation  (SBR-95 12670)  is  gratefully  acknowledged. 


1.  Introduction 


Health  status  is  hard  to  measure.  It  is  widely  recognized  that  health  is  multi-dimensional  reflecting 
the  combination  of  an  array  of  factors  that  include  physical,  mental  and  social  well-being,  genotype  and 
phenotype  influences  as  well  as  expectations  and  information.  A  multitude  of  health  indicators  have  been 
used  in  scientific  studies  drawing  on  data  from  both  the  developed  and  developing  world.  Understanding 
what  those  indicators  measure  is  central  if  the  results  reported  in  the  studies  are  to  be  interpreted  in  a 
meaningful  way.  Whether  one  is  interested  in  summarizing  the  health  of  a  population  or  understanding 
the  links  between  health  and  other  measures  of  well-being  at  the  individual  level,  poor  measurement  will 
likely  yield  poor  inferences. 

There  is  a  large  literature  that  discusses  the  validity  and  limitations  of  different  health  measures. 
Murray  and  Chen  (1992)  and  Sadana  (2001)  provide  excellent  reviews  and  discussion.  Some  of  the  most 
insightful  empirical  studies  have  compared  indicators  of  specific  morbidities  reported  by  respondents  in 
health  interview  surveys  with  indicators  based  on  health  examinations  of  the  same  individuals  conducted 
by  trained  health  workers.  Other  very  influential  studies  have  examined  the  extent  to  which  self-reported 
health  predicts  health  problems  later  in  life.  A  third  class  of  studies  contrasts  prevalence  rates  based  on 
health  interviews  with  other  sources. 

This  paper  attempts  to  provide  insights  into  the  meaning  of  a  set  of  relatively  general  health 
indicators  that  are  commonly  collected  in  health  interview  surveys.  These  self-reports  are  contrasted  with 
a  battery  of  physical  assessments  conducted  on  the  same  respondents  in  the  same  survey,  usually  a  day 
or  two  after  the  interview. 

The  key  complexity  that  arises  from  these  comparisons  is  that  "true"  health  status  is  seldom  known. 
To  side-step  this  complexity,  we  begin  with  indicators  of  health  (or,  more  precisely,  nutritional)  status 
which  can  be  measured  without  any  ambiguity:  height  and  weight.  Contrasting  measurements  with  self- 
reports  for  the  same  person,  we  find  that  self-reports  are  subject  to  systematic  biases.  For  example,  men 
are  inclined  to  report  themselves  as  being  significantly  taller  than  they  actually  are;  women  are  inclined 
to  report  themselves  as  significantly  lighter  than  they  are.  If  these  were  the  only  differences,  it  would  be 
simple  to  adjust  reports  so  they  map  into  measurements.  They  are  not.  There  is  evidence  that  people  are 
inclined  to  report  their  health  status  as  being  closer  to  an  "ideal"  than  it  is.  This  example  highlights  the 
complexity  of  deriving  adjustment  factors  to  offset  the  effects  of  reporting  biases. 
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We  proceed  to  examine  a  series  of  self-reported  health  indicators  collected  in  the  second  wave  of 
the  Indonesia  Family  Life  Survey  (IFLS2).  In  these  cases,  truth  is  not  known  and  so  we  turn  to  a 
comparison  of  the  relationships  between  health  status  and  socio-economic  status  (SES)  to  provide  some 
insights  into  the  meaning  of  these  measures.  Contrasting  the  link  between  SES  and  physical  assessments 
with  the  link  between  SES  and  self-reported  health,  we  suggest  that  the  more  general  the  health  indicator, 
the  more  difficult  it  is  to  disentangle  "true"  health  status  from  "reporting  effects."1  However,  an 
examination  of  the  relationships  between  self-reports  and  physical  assessments  suggests  that  self-reports, 
particularly  more  general  questions,  do  contain  a  good  deal  of  information  about  the  respondent’s  health. 

We  conclude  that,  given  the  state  of  knowledge  at  this  time,  it  would  be  prudent  to  invest  in  the 
collection  of  both  self-reported  health  indicators  and  physical  assessments  of  health  status  in  individual  and 
household  surveys.  Recent  technological  advances  in  health  measurement  render  it  feasible  to  conduct  a 
broad  array  of  assessments  in  face-to-face  surveys  at  relatively  low  costs.  The  benefits  are  likely  to  far 
outweigh  the  costs. 

2.  Self-reports  and  measured  health 

Methods  that  have  been  used  to  measure  health  in  social  surveys  span  a  wide  spectrum,  from  one 
or  two  direct  questions  asked  of  individuals  about  their  perception  of  their  overall  health  to  a 
comprehensive  and  thorough  battery  of  clinical  assessments  conducted  by  physicians  or  other  trained 
personnel  who  examine  each  respondent.  For  a  general  review  of  issues  associated  with  health  interview 
survey  implementation,  see  Kroeger  (1985)  and  Ross  and  Vaughan  (1986);  Fisher,  Pappas  and  Limb 
(1996)  provide  a  discussion  in  the  context  of  health  examination  surveys. 

The  vast  majority  of  surveys  in  both  developed  and  developing  countries  have  relied  on  individual 
responses  to  a  series  of  questions  about  their  health  posed  in  an  interview  setting.2  As  a  starting  point, 

%urg  (2000)  presents  related  evidence  from  Denmark  and  the  United  States  in  which  observed  health  is  contrasted 
with  both  self-assessments  and  physician-assessments. 

2The  content  and  quantity  of  those  questions  depends  on  the  number  of  different  dimensions  of  health  that  are  of 
interest  and,  within  each  dimension,  on  the  desired  sensitivity  to  different  levels  of  well-being.  Standard  health 
assessment  instruments  include,  for  example,  the  Sickness  Impact  Profile,  the  Quality  of  Well-being  Scale,  and  the 
Nottingham  Health  Profile.  A  well-known  study  (and  resulting  instrument)  is  the  RAND  Medical  Outcomes  Study 
(MOS).  The  full  version  of  the  MOS  questionnaire  contains  149  items  and  requires  an  estimated  30-40  minutes  to 
administer  (Stewart  and  Ware,  1992).  A  reduced  instrument,  SF-36,  has  been  widely  adopted  and  is  advocated  as 
a  good  basic  tool  for  multi-purpose  surveys.  For  a  good  discussion  of  issues  about  wording,  recall  periods  etc.,  see 
Murray  and  Chen  (1992). 
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let  self-reported  health,  0s,  indicate  underlying  health,  0*,  with  adjustments  that  are,  for  simplicity  of 
exposition,  assumed  to  be  additive  and  comprise  three  parts:  an  individual  idiosyncratic  component,  e,,  a 
factor  associated  with  the  specific  observable  indicator,  Ej,  and  a  factor  that  varies  with  both  the  individual 
and  the  indicator,  e^: 

0fj  =  0*  +  ei  +  ej  +  eij  m 

where  0*  is  multi-dimensional  and  hence  vector-valued.  As  a  convention,  we  will  treat  higher  levels  of 
0  as  denoting  better  health  status.  It  is  useful  to  provide  concrete  examples  for  the  three  types  of 
adjustments.  First,  consider  two  individuals  with  the  same  level  of  intrinsic  health,  0*;  if  one  is  more 
inclined  to  report  a  health  problem  than  the  other,  then  this  difference  will  be  reflected  in  e,.  Second,  say 
a  health  state  is  difficult  to  detect  because  it  is  asymptomatic;  self-reports  will  tend  to  be  noisy  indicators 
of  the  prevalence  of  the  state  as  reflected  in  a  relatively  large  variance  component,  Ej.  Third,  if  a  health 
state  is  better  known  among  some  people,  relative  to  others,  then  these  differences  will  be  reflected  in  the 
component  ey.  This  might  arise,  for  example,  if  we  were  to  focus  on  diagnosed  health  problems;  if  two 
people  have  the  same  level  of  intrinsic  health,  a  respondent  who  has  had  more  contact  with  health  services 
is  more  likely  to  report  a  diagnosis  relative  to  a  respondent  who  has  never  seen  a  trained  physician.3 

Two  self-reported  indicators,  0n  and  0i2,  reported  by  the  same  individual  will  share  the  common 
component,  e:,  which,  intuitively,  may  be  thought  of  as  the  propensity  to  report  oneself  as  ill.  As  a  result, 
the  two  will  be  positively  correlated  even  if  neither  is  related  to  0*.  Clearly,  "internal  validity"  of  self- 
reported  measures,  based  on  their  being  con-elated  with  one  another,  does  not  imply  they  are  valid 
indicators  of  0  .  In  practice,  E;  does  provides  information  about  the  respondent’s  perception  of  his  or  her 
own  health  which  may  be  an  important  dimension  of  health  status.  Recognizing  this,  we  refer  to  Ej  as  a 
"(self)-reporting  effect"  for  want  of  a  better  shorthand. 

There  are  relatively  few  examples  of  examination  surveys  that  have  been  conducted  on  a  large 
scale.  This  presumably  reflects  the  costs  and  complexity  of  fielding  these  sorts  of  surveys.  They  are, 
however,  a  very  rich  source  of  information.  One  of  the  best  examples,  perhaps,  is  the  United  States 
National  Health  and  Nutrition  Examination  Survey  (NHANES)  which  was  started  in  the  late  1950s  and 


3It  is  straightforward  to  allow  the  adjustments  to  depend  on  the  level  of  intrinsic  health,  in  which  case  the  effects 
become  multiplicative.  This  would  make  sense  if  respondents  are  inclined  to  report  their  own  health  as  close  to  the 
norm  (average  health)  so  that  the  adjustment  would  be  largest  for  those  in  poorest  (or  best)  health.  We  will  allow 
0  and  e  to  depend  on  a  common  characteristic,  x,  below  which  also  unties  the  additivity  assumption. 
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is  now  run  on  a  continuing  basis.  There  are  a  small  number  of  similar  surveys  that  have  been  successfully 
conducted  in  low  income  settings  which  suggest  that  examination  surveys  are  not  only  feasible  but  may 
also  be  desirable  under  certain  circumstances  (see,  for  example,  Fisher,  Pappas  and  Limb,  1996.) 

In  contrast  with  self-reports,  health  measurements  conducted  by  a  trained  nurse  or  physician  will 
not  be  affected  by  reporting  error4  and  for  those  measures  that  require  no  respondent  interaction  (such  as 
height)  there  may  be  no  individual-specific  measure-specific  errors  (^=0)/  Some  measures  do  involve 
respondent  interaction  (such  as  blowing  into  a  tube  to  measure  lung  capacity)  and,  like  self-reports,  they 
are  not  perfect  indicators  of  0!  (^0).  Thus,  in  general,  measured  health,  0M  is: 

e?i  =  e;  +  ^  [2] 

Since  interviews  are  a  good  deal  cheaper  to  administer  and  self-reports  have  been  included  in  a 
large  number  of  household  surveys,  it  would  be  very  convenient  if  the  relationships  between  0s,  0M  and 
intrinsic  health,  0*,  were  mapped  out. 

Several  very  important  studies  have  sought  to  validate  self-reports  against  clinical  measurements; 
typically  these  studies  have  contrasted  0^  with  0^  and  assessed  the  specificity  and  sensitivity  of  the  self- 
report.  These  comparisons  tend  to  be  made  with  specific  health  indicators  such  as  morbidities  associated 
with  particular  diseases.  It  is  usually  assumed  that  ^  and  ^  are  both  zero  so  that  the  difference  between 
self-reported  and  measured  health  is  attributed  to  the  reporting  effects,  e.  Typically,  self-reports  tend  to 
understate  the  incidence  of  a  health  problem  (Ee^O)  and  this  difference  is  greater  in  the  case  of  diseases 
that  are  asymptomatic  (ECjCO). 

Using  data  from  the  United  States,  for  example,  Krueger  (1957)  and  National  Center  for  Health 
Statistics  (1965)  indicate  there  is  a  wide  gap  between  respondent  perceptions  of  health  and  evaluations  by 
a  physician  and  conclude  that  self-reports  are  unlikely  to  serve  as  a  tool  for  the  measurement  of 
diagnosable  disease.  Belcher  et  al.  (1976)  provide  an  early  example  in  a  developing  country  setting. 
Respondent  evaluations  of  morbidities  were  compared  with  physician  reports  of  the  same  people  one  to 
four  days  later  in  a  study  conducted  in  Ghana.  They  treat  the  physician  assessment  as  truth  and  conclude 


4There  may  be  healthworker-specific  measurement  error  although  that  can  be  minimized  by  good  training,  field 
practice  and  supervision. 

5This  restriction  will  not  apply  if  the  amount  of  information  about  health  contained  in  any  measure  varies  with 
individual  characteristics. 
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there  is  a  tendency  for  self-reports  to  understate  the  incidence  of  health  problems  and  that  the  extent  of 
understatement  is  greater  the  less  readily  observed  the  ailment. 

There  are,  however,  several  problems  that  are  difficult  to  detect  in  a  clinic  setting  (such  as 
problems  that  emerge  intermittently)  in  which  case  ^  will  not  be  zero.  Moreover,  even  in  a  clinic,  many 
health  problems  involve  eliciting  information  from  the  respondent  (such  as  pain)  in  which  case  £y  may  not 
be  zero.  More  generally,  it  is  reasonable  to  argue  that  clinical  measures  are  not  perfect  and  that  in  some 
instances,  one's  perception  of  one’s  health  is  of  legitimate  concern  to  health  professionals  and  policy 
makers.  (See,  for  example,  Zurayk  et  al.,  1993a,  1993b,  1995,  for  an  illuminating  discussion  in  the  context 
of  reproductive  morbidity.)  In  general,  it  is  going  to  be  very  difficult  to  separately  distinguish  the 
components  e  and  %  in  [1]  and  [2]. 

To  illustrate  some  of  the  complexities,  we  begin  with  a  very  simple  case  in  which  0*  and  0M  are 
identical  and  so  \  is  zero;  we  consider  height  and  weight  of  adults  which  are  considered  to  be  indicators 
of  nutritional  status.  (Since  our  attention  is  focussed  on  measurement,  we  put  aside  the  question  of 
whether  anthropometries  are  indicative  of  health  status.)  The  difference,  [l]-[2], 

e?j  -  e*  =  +  *j  +  h  P] 

is  reported  in  Figure  1  using  measures  and  self-reports  for  the  same  individuals  age  20  through  90  in  the 
third  round  of  NHANES  (conducted  between  1988  and  1994).6  For  prime  age  adults,  height  is  fixed  and 
so  any  difference  between  reported  and  measured  height  must  be  due  to  reporting  effects.  As  seen  in  the 
upper  panel  of  the  figure,  men  tend  to  overstate  their  height  —  by  around  1cm  until  age  50  when  the 
overstatement  increases  with  age.  Apparently,  as  men  shrink  with  age,  they  do  not  update  their  height. 
Women  also  overstate  their  height  but  the  extent  of  overstatement  is  small  (and  not  significant)  until  they 


reach  age  50. 


The  difference  between  reported  and  measured  weight  is  presented  in  the  lower  panel.  Weight 
does  vary  through  the  day  and  between  days  which  will  affect  the  variance  of  the  deviation;  it  should  not, 
however,  affect  the  first  moment  of  the  difference  which  we  would  expect  to  be  zero.  It  is  not.  Whereas, 
on  average,  men  tend  to  overstate  their  weight  by  nearly  two-thirds  of  a  kilogram,  women  tend  to 
understate  weight  by  nearly  1 .5  kgs.  (The  standard  errors  are  0.07  kg  in  each  case  and  so  the  differences 


6The  figures  relate  the  gap  between  reported  and  measured  anthropometry  with  respondent  age.  The  non-parametric 
locally  weighted  smoothed  scatterplots  (Cleveland,  1980)  use  a  tricube  weighting  function  and  20%  bandwidth. 
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are  significant.)  Weight  under-reporting  is  greatest  among  younger  women,  declines  with  age  and,  among 
women  in  their  80s,  reported  weight  exceeds  measured  weight. 

In  principle,  if  it  is  known  that  men  tend  to  report  themselves  as  taller  and  heavier  than  they  are 
and  that  women  report  themselves  as  lighter  than  they  are,  it  would  be  possible  to  adjust  reported  height 
and  weight  (taking  account  of  differences  by  age  group)  and  not  incur  the  costs  of  conducting 
anthropometric  measures.  That  conclusion  would  be  premature. 

Table  1  reports  the  correlates  associated  with  the  difference  between  reported  anthropometry  and 
measured  anthropometry  (0-j  -  0^).  The  age  splines  essentially  replicate  the  shapes  in  Figure  1.  In 
household  surveys,  it  is  often  argued  that  better  educated  respondents  provide  more  accurate  answers;  in 
the  health  measurement  literature  this  might  be  attributed  to  the  better  educated  having  a  more  complete 
set  of  information.  However,  we  see  that  better  educated  men  tend  to  over-report  height  more  than  those 
who  are  less  educated.  Better  educated  women,  on  the  other  hand,  tend  to  report  heights  that  are  closer 
to  the  truth.  For  weight,  the  reverse  is  true.  Better  educated  men  tend  to  report  weights  that  are  closer 
to  the  truth  whereas  better  educated  women  tend  to  understate  their  weight  more  than  those  with  less 
schooling.  Apparently  the  link  between  education  and  accuracy  of  reporting  is  not  simple. 

To  probe  more  deeply,  the  regressions  also  include  controls  for  whether  the  respondent  feels  he 
or  she  is  underweight  or  overweight.  They  are  powerful  predictors  of  the  difference  between  reported  and 
measured  weight:  those  who  feel  they  are  overweight  tend  to  understate  their  weight  whereas  those  who 
feel  they  need  to  put  on  weight  tend  to  overstate  their  weight.  The  same  result  emerges  if  we  replace  these 
controls  with  indicators  for  respondents  who  are  trying  to  gain  or  lose  weight.  (Hope  springs  eternal?) 

Apparently,  even  in  the  case  of  indicators  that  are  easily  verified,  such  as  height  and  weight,  the 
gap  between  perceptions  and  reality  is  large,  significant  and  depends  on  a  host  of  factors.  Putting  aside 
the  fact  that  older  respondents,  whose  height  declines  with  age,  do  not  appear  to  update  their  perception 
of  their  height,  one  is  tempted  in  this  example  to  conclude  that  self  reports  reflect  not  only  0"  but  also  the 
respondent's  own  perception  of  some  ideal  health  status.  Men  want  to  be  taller  and  as  education  increases 
their  ideal  height  increases;  women  want  to  be  thinner  and  as  their  education  increases  their  ideal  weight 
decreases.  And  those  whose  weight  deviates  from  their  ideal  want  to  move  towards  that  ideal. 

Using  the  same  source  of  data  but  examining  child  height,  Strauss  and  Thomas  (1996)  observe  that 
the  gap  between  maternal  reports  and  measurements  tend  to  be  smaller  among  higher  income  and  better 
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educated  mothers  and  the  gaps  are  smaller  among  older  children.  They  suggest  this  likely  reflects 
differences  in  the  frequency  that  children  are  measured  and  would  arise  if  mothers  of  higher  socio¬ 
economic  status  (SES)  measure  their  children  (or  visit  health  centers  that  measure  their  children)  more 
frequently  than  women  of  lower  SES.  The  declining  gap  across  the  age  distribution  is  consistent  with  this 
interpretation  since  the  rate  of  growth  in  height  declines  with  age. 

Taken  together,  these  results  suggest  mapping  the  association  between  self  reports  of  health  and 
"true"  health  will  be  difficult  —  even  in  cases  in  which  there  is  no  argument  about  the  meaning  of  "truth". 
Things  become  even  more  complicated  in  the  more  general  case  in  which  the  terms  %  in  [2]  are  not 
reasonably  assumed  to  be  zero. 

3.  Reported  health,  measured  health  and  respondent  characteristics 

The  majority  of  studies  that  cross- validate  reported  health  with  measures  of  health  have  contrasted 
specific  symptoms  or  diagnoses  reported  by  a  respondent  in  an  interview  with  clinical  assessments.  Many 
of  the  health  indicators  recorded  in  health  interviews  are,  however,  very  difficult  to  cross-validate  because 
there  is  no  obvious  clinical  counterpart.  This  is,  perhaps,  least  controversial  in  the  case  of  general  health 
indicators  which  are  often  collected  in  interview  surveys;  indeed,  many  multi-purpose  surveys  focus 
primarily  on  "global"  or  "summary"  health  status  indicators  precisely  because  the  number  of  items  required 
to  "fully"  characterize  health  status  is  large. 

The  most  commonly  collected  item  is  self-reported  general  health  status  (GHS).  One  of  the  most 
extensively  documented  relationships  to  emerge  in  the  literature  on  health  status  is  that  self-reported  GHS 
is  a  significant  predictor  of  subsequent  mortality.  (See  McCallum,  Shadbolt  and  Wang,  1994;  Idler  and 
Benyamini,  1997,  provide  a  recent  review.) 

It  is  not  obvious  what  accounts  for  the  strong  relationship  between  self-rated  health  and  subsequent 
mortality.  It  is  likely  that  self-reported  health  is  very  inclusive  with  respect  to  both  the  range  and  severity 
of  conditions  that  it  reflects.7  Respondents  may  incorporate  knowledge  of  family  history  (and  thus  genetic 


7Self-reported  health  may  pick  up  diseases  that  have  begun  to  affect  respondents'  health  but  that  have  not  yet  been 
diagnosed.  However,  one  of  the  studies  that  found  significant  impacts  of  GHS  on  subsequent  mortality  found  no 
effects  of  GHS  on  the  onset  of  chronic  diseases  (Pijls,  Feskins,  and  Kromhout,  1993). 
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health  endowment)  into  their  answer.8  Possibly  self-rated  health  is  associated  with  practices  and  resources 
that  affect  subsequent  health.  In-depth  interviews  suggest  that  the  presence  or  absence  of  particular  health 
problems  or  of  positive  and  negative  health  behaviors  shapes  the  responses  of  the  majority  of  answers  to 
questions  on  general  health  status.  Moreover,  this  work  suggests  that  factors  which  lead  to  a  rating  of 
good  health  differ  from  those  associated  with  a  rating  of  poor  health  (Krause  and  Jay,  1994).  Respondents 
in  "good"  health  reported  choosing  this  rating  on  the  basis  of  comparisons  with  other  people  far  more 
frequently  than  did  respondents  who  rated  themselves  in  fair  or  poor  health.  All  of  the  respondents  in  poor 
health  cited  health  problems  or  difficulty  with  physical  functioning  (not  health  comparisons  or  health 
behaviors)  as  the  reason  for  their  choice  of  rating  (Krause  and  Jay,  1994).  Several  studies  confirm  these 
findings  and  the  results  indicate  that  it  would  be  incorrect  to  treat  "good"  and  "poor"  health  symmetrically 
(Smith,  Shelley,  and  Dennerstein,  1994). 

While  one  can  certainly  ask  a  physician  to  provide  an  assessment  of  an  individual’s  overall  health, 
0M,  unless  the  physician  has  a  complete  individual  and  family  history,  a  good  working  relationship  with 
the  respondent,  and  knows  a  considerable  amount  about  the  respondent’s  life  style,  0M  will  measure  "true" 
health,  0*,  with  error.  Given  the  model  outlined  above,  it  is  not  clear  what  to  make  of  comparisons 
between  0s  and  0M  since  we  know  nothing  about  £  and 

To  provide  an  illustration,  we  draw  on  data  from  the  second  wave  of  the  Indonesian  Family  Life 
Survey  (IFLS2).9  In  the  survey,  respondents  were  asked  a  long  battery  of  health  questions  that  included 


8This  is  not  the  entire  explanation.  Even  after  controlling  family  history,  several  studies  demonstrate  that  GHS 
continues  to  be  a  significant  predictor  of  mortality  (Pijls,  Feskins,  and  Kromhout,  1993;  Borawski,  Kinney,  and 
Kahana,  1996;  Deeg  et  al.,  1989). 

9The  IFLS  is  an  on-going  longitudinal  survey  of  individuals,  households,  families  and  communities  in  Indonesia. 
The  first  wave  was  conducted  in  the  last  half  of  1993  and  included  a  sample  of  7,224  households  in  321  communities 
in  13  of  the  archipelago's  provinces;  the  sample  is  representative  of  83%  of  the  population  of  Indonesia.  The  second 
wave  sought  to  re-interview  IFLS1  respondents  four  years  later  in  1997:  over  94%  of  the  IFLS1  households  were 
located  and  interviews  were  completed  with  more  than  30,000  respondents.  Enumerators  interviewed  all  household 
members  (apart  from  young  children  who  were  interviewed  by  proxy)  to  obtain  comprehensive  information  on 
consumption,  income  and  wealth  of  the  family,  labor  market  histories,  education  histories,  migration  histories, 
marriage  histories  and  fertility  histories  of  each  household  member.  While  the  survey  is  fundamentally  multi¬ 
purpose,  it  does  contain  a  good  deal  of  detail  on  health  status  and  health  behaviors.  Each  respondent  completed  a 
battery  of  questions  about  their  own  health,  use  of  health  care,  health  insurance,  smoking  and  the  health  of  non  co¬ 
resident  family  members.  In  addition,  a  trained  healthworker  visited  each  household  and  conducted  a  series  of 
physical  assessments  in  the  home.  IFLS2  was  coming  out  of  the  field  as  Indonesia  fell  into  the  biggest  economic 
and  financial  crisis  in  three  decades  and  so  we  returned  to  the  field  and  conducted  a  re-survey  of  25%  of  the 
enumeration  areas  included  in  IFLS.  That  survey,  IFLS2+,  was  completed  in  late  1998  and  the  next  wave,  IFLS3, 
was  completed  during  the  last  half  of  2000.  The  surveys  provide  a  unique  opportunity  to  assess  the  impact  of  the 
crisis  on  the  health  and  well-being  of  the  Indonesian  population. 
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their  current  general  health  status,  difficulties  with  a  series  of  Activities  of  Daily  Living  (ADLs),  questions 
on  specific  morbidities  (including  probes  to  obtain  details)  and  extensive  questions  about  use  of  health  care. 
In  addition,  a  trained  health  worker  (a  nurse  or  recently  qualified  doctor)  visited  the  household  separately 
and  conducted  a  series  of  physical  assessments.  These  measures  were  selected  to  represent  different 
dimensions  of  health  that  are  of  epidemiological  relevance  to  Indonesia  (and  most  developing  countries) 
while  also  being  feasible  to  field  in  a  household  survey  setting.  For  each  adult  respondent,  the  health 
worker  measured  height,  weight,  hemoglobin  status,  lung  capacity,  blood  pressure,  and  the  speed  with 
which  the  respondent  was  able  to  stand  up  five  times  from  a  sitting  position.10  Additionally,  at  the  end 
of  the  assessment  and  after  a  brief  discussion  with  the  respondent,  the  healthworker  rated  the  person’s 
health  relative  to  someone  of  average  health  of  the  same  age  and  sex  using  a  scale  of  1  (for  very  poor) 
through  9  (for  excellent). 

Figure  2  presents  the  joint  distribution  of  self-reported  GHS  and  healthworker  rating.  It  is 
comforting  that  the  mean  healthworker  score  declines  as  self-reports  move  from  good  to  average  and  then 
to  poor.  As  noted  above,  it  has  often  been  observed  that  the  distinction  between  "good"  and  "average"  is 
less  clear-cut  than  the  difference  between  "average"  and  "poor."  This  is  reflected  in  the  scores  assigned 
by  the  healthworkers.  The  marginal  distributions  of  the  healthworker  scores  within  each  GHS  category 
demonstrate  that  there  is  a  good  deal  of  heterogeneity  within  each  category.  There  has  to  be:  it  is  hard 
to  imagine  that  a  construct  as  complex  as  health  status  will  be  adequately  summarized  by  one  trichotomous 
indicator.  The  extent  of  overlap  in  the  marginal  distributions  across  the  GHS  categories  is,  however,  very 
substantial  and  is,  perhaps,  cause  for  pause.  While  it  is  likely  that  a  complete  physical  examination,  the 
results  of  a  battery  of  tests  and  a  family  history  would  reduce  the  variance  in  the  healthworker  scores,  the 
central  issue  will  likely  remain:  it  is  not  possible  to  separately  identify  e  and  %  when  comparisons  are  made 
between  self-reports  and  measures  of  health  when  neither  can  legitimately  be  construed  as  revealing  some 
intrinsic  level  of  health. 

We  turn,  therefore,  to  an  alternative  strategy  in  order  to  better  understand  the  nature  of  information 
contained  in  self-reported  and  measured  health  indicators.  Contrasting  the  correlations  between  respondent 


10Height  is  measured  with  Shorr  wooden  measuring  boards,  weight  with  SECA  scales  specially  designed  for  fielding 
anthropometry  surveys  conducted  by  UNICEF.  Blood  hemoglobin  levels  are  measured  using  the  Hemocue  portable 
photometer.  Lung  capacity  is  measured  with  Personal  Best  peak  flow  meters;  each  respondent  performs  the  test  three 
times  and  the  top  score  is  used  in  the  analyses  below.  Blood  pressure  is  measured  using  an  Omran  self-inflating 
meter  with  a  digital  readout. 
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characteristics  and  self-reported  health  with  the  correlations  between  the  same  characteristics  and  measured 
assessments  affords  an  opportunity  to  provide  a  characterization  of  how  the  meaning  of  health  varies  with 
those  characteristics.  For  concreteness,  consider  education  as  an  example.  The  correlation  between  it  and 
self-reported  health  will  reflect  at  least  two  components:  the  influence  of  education  on  underlying  intrinsic 
health  and  also  the  relationship  between  education  and  the  respondent's  perception  of  his  or  her  own 
health.  In  the  context  of  the  additive  model  above,  both  the  underlying  dimensions  of  health  and  the 
components  associated  with  reporting  effects  may  depend  on  socio-economic  characteristics,  x: 

0-j  =  0*j(x)  +  E  j(x)  +  8j  +  Ey(x)  [4] 

Education  will  be  associated  with  an  individual’s  reporting  propensity,  Ej,  if  better  educated  people  are 
better  informed  about  their  health  and  are,  therefore,  more  likely  to  identify  a  health  problem  (conditional 
on  a  particular  level  of  0  ).  If  what  is  construed  as  "normal"  (or  "difficult”)  varies  with  education,  then 
that  too  will  be  captured  in  e;(x). 

The  relationship  between  "reporting  effects"  and  education  is  likely  to  differ  depending  on  the 
particular  self-reported  indicators  and  so  Ey  is  allowed  to  be  a  function  of  x.  For  example,  a  person  with 
little  or  no  education  may  not  know  about  health  ailments  that  afflict  him  or  her  and  so  may  report  GHS 
as  very  good;  a  better  educated  person,  however,  with  the  same  level  of  0*  but  more  information  about 
it  may  perceive  himself  or  herself  to  not  be  in  very  good  health.  In  contrast,  whether  a  person  has 
difficulty  breathing  —  a  relatively  obvious  problem  -  may  be  less  prone  to  "reporting  effects"  of  this  form. 
In  general,  the  more  "objective”  the  indicator  —  or  the  more  objectively  defined  the  reference  categoiy  — 
and  the  more  obvious  the  health  problem,  the  less  likely  that  "reporting  effects"  will  vary  with  education 
or  any  other  element  of  the  vector  of  socio-economic  characteristics,  x.11 

In  contrast  to  self-reports,  correlations  between  socio-economic  characteristics  and  physical 
assessments  will  not  be  affected  by  individual  characteristics  unless  the  assessment  involves  participation 
of  the  respondent  (such  as  performing  the  puff  test)  and  that  participation  is  related  to  the  characteristic. 
Thus 

=  #Ij(x)  +  Cj  +  e8  [5] 


"See,  for  example,  Mackenbach  et  al.  (1996)  who  report  greater  under-reporting  of  chronic  conditions  by  those  with 
less  education  in  the  Netherlands. 
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and  so  differences  in  the  effects  of  x  on  elements  of  0s  and  0M  that  are  closely  related  to  the  same 
underlying  health  may  be  interpreted  as  being  informative  about  (dejdx  +  det/dx),  that  is  the  "reporting 
effects"  in  self-reports.  These  comparisons  will  be  contaminated  if  the  health  indicators  differ  in  their 
utility  as  measures  of  9*  which  argues  for  basing  inferences  on  multiple  health  status  indicators. 

Table  2  presents  results  of  the  correlations  between  a  series  of  health  indicators  and  respondent 
characteristics  for  adults  age  20  through  80  in  IFLS2.  In  addition  to  gender,  we  focus  on  four  respondent 
characteristics:  per  capita  expenditure  (PCE)  levels  in  the  household  which  is  a  measure  of  longer-run 
household  resources  and  can  be  thought  of  as  an  indicator  of  SES;  the  education  and  height  of  the 
respondent,  both  of  which  reflect  human  capital  investments  earlier  in  life  and,  finally,  age.  The  models 
include  splines  in  the  covariates  to  permit  non-linearities  in  their  effects. 

Table  2A  focuses  on  physical  assessments  conducted  by  the  health  worker.  The  first  two  reflect 
nutritional  status  of  the  respondent.  Body  mass  index  (BMI),  weight/height2,  is  thought  to  be  correlated 
with  physical  capacity  and  V02max,  and  extremes  of  BMI  have  been  shown  to  be  related  to  elevated 
morbidity  and  mortality.  For  both  males  and  females,  BMI  tends  to  increase  as  household  resources 
increase,  particularly  among  the  poorest,  and  as  education  increases,  at  least  for  those  with  less  than  6 
years  of  schooling.  (Among  better  educated  women,  BMI  declines  with  education  after  controlling  PCE.) 

Low  levels  of  hemoglobin  indicate  iron-deficiency  which  has  been  shown  to  be  linked  to 
susceptibility  to  disease,  fatigue  and  lower  levels  of  productivity.  Hemoglobin  levels  likely  reflect  the 
combination  of  a  diet  that  is  higher  in  animal  proteins  (a  primary  source  of  iron)  and  greater  absorption 
capacity  (which  is  reduced  by  disease  insults,  loss  of  blood  and  also  by  diets  that  are  high  in  rice).  As 
with  BMI,  better  educated  men  have  higher  levels  of  hemoglobin;  the  same  is  true  of  women  in  the  lower 
half  of  the  SES  spectrum.  Greater  attained  height  is  associated  with  elevated  hemoglobin  counts 
suggesting  that  the  effects  of  early  childhood  nutrition  experiences  may  reach  into  adulthood;  an  alternative 
interpretation  that  height  is  capturing  the  influence  of  SES  is  not  very  appealing  given  the  latter’s  relatively 
modest  effect  in  the  regression.  Hemoglobin  levels  of  men  tend  to  decline  with  age;  this  may  reflect  the 
impact  of  better  nutrition  among  more  recent  cohorts  or  the  cumulative  effect  of  disease  insults  among 
older  cohorts.  This  pattern  among  women  is  complicated  by  the  fact  that  reproductive  age  women  are 
prone  to  elevated  levels  of  anemia. 
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.Stature  and  lung  size  are  positively  correlated  and  so  height  is  a  very  good  predictor  of  lung 
capacity.  Lung  capacity  is  also  higher  among  better  educated  and  younger  males  and  females.  Although 
urban  dwellers  tend  to  be  taller,  and  have  higher  BMI,  there  is  no  difference  in  lung  capacity  between 
urban  and  rural  residents  which  presumably  reflects  lower  air  quality  in  many  urban  areas  in  Indonesia. 

The  time  it  takes  for  a  respondent  to  stand  from  a  sitting  position  is  intended  to  capture  muscular- 
skeletal  problems  and  primarily  reflects  lower  body  motor  functioning.  Each  respondent  was  asked  to 
repeat  the  test  five  times  and  the  total  time  taken  was  recorded  in  seconds.  Like  lung  capacity,  better 
educated  and  younger  respondents  perform  better  on  this  test. 

The  fifth  physical  measure  is  blood  pressure;  in  the  regressions,  we  estimate  the  probability  a 
respondent's  measured  blood  pressure  is  high  (>160,  >100).  For  both  men  and  women,  blood  pressure  is 
elevated  among  urban  residents  and  tends  to  rise  with  age  until  around  60.  Blood  pressure  also  increases 
with  education  until  completion  of  primary  school;  among  better  educated  males,  the  probability  of  high 
blood  pressure  is  constant  and  declines  with  education  among  women.  Since  elevated  blood  pressure  tends 
to  be  correlated  with  weight  (and  thus  rise  with  SES)  and  with  stress  (and  thus  possibly  rise  with  wages), 
it  is  plausible  that  the  relatively  modest  relationships  between  SES  and  blood  pressure  reflect  the  effect 
of  treatment  among  the  better  off  and  the  better  educated. 

Finally,  the  health  worker's  evaluation  of  each  respondent's  general  health  status  is  positively 
associated  with  PCE,  especially  among  the  poorest,  with  education  and  with  height.  There  is  little 
association  with  age  --  which  is  as  it  should  be  since  the  healthworker  was  instructed  to  compare  the 
respondent's  health  with  the  average  for  a  person  of  the  same  age. 

Of  these  measures,  only  BMI  and  hemoglobin  satisfy  the  condition  that  measurement  error,  is 
unlikely  to  be  related  to  respondent  characteristics.  For  the  others,  things  are  a  bit  more  complicated. 
Lung  capacity  and  sit-to-stand  times  require  interaction  between  the  healthworker  and  respondent.  In  the 
field,  not  all  respondents  approached  these  tests  the  same:  older,  higher  SES  women  were  sometimes  more 
tentative  than  other  respondents.  In  our  judgement,  the  impact  of  these  differences  is  relatively  modest. 
Health  interventions  that  moderate  blood  pressure  are  more  likely  among  higher  SES  respondents,  and  so 
we  are  cautious  in  our  interpretation  of  those  results.  While  the  healthworkers  were  instructed  to  only 
evaluate  the  health  of  the  respondent,  we  cannot  rule  out  the  possibility  that  the  healthworker  was 
influenced  by  the  respondent's  demeanor  and  attitude  which  is  likely  to  be  linked  to  SES. 
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With  this  in  mind,  there  are  several  general  observations  that  might  be  drawn  from  Table  2A. 
First,  measured  health  tends  to  be  positively  associated  with  SES  —  PCE  or  education  —  particularly  in 
the  case  of  those  indicators  that  do  not  involve  respondent  interaction.  Second,  except  in  those  cases  where 
there  are  reasons  to  expect  otherwise,  the  measured  indicators  suggest  that  health  status  declines  with  age. 

Table  2B  presents  comparable  regressions  drawing  on  some  of  the  self-reports  of  health  status 
collected  in  IFLS2.  Many  health  interview  surveys  include  questions  about  ADLs,  the  respondent's 
perception  of  his  or  her  ability  to  perform  physical  activities  such  as  walking  or  lifting  heavy  items,  or 
activities  that  are  necessary  in  daily  life  such  as  bathing  or  dressing,  Instrumental  Activities  of  Daily 
Living,  (I ADLs).  We  examine  two  of  the  questions  included  in  IFLS2  and  report,  in  the  first  columns  of 
the  table,  the  characteristics  associated  with  a  higher  probability  that  a  respondent  reports  having  difficulty 
walking  5  kms  (columns  1  and  2)  and  difficulty  carrying  a  heavy  load  (columns  3  and  4). 

It  has  been  argued  that  questions  about  ADLs  are  easy  for  respondents  to  understand  as  they  ask 
about  activities  that  are  well-defined  and  capture  (or  are  good  proxies  for)  important  dimensions  of 
functional  health  status.  Theoretical  work  has  related  ADLs  to  the  underlying  health  dimensions  they  are 
intended  to  tap  (Nagi,  1965;  Kopec,  1995;  Johnson  and  Wolinsky,  1993).  While  there  is  controversy  in 
the  literature  regarding  the  success  with  which  ADLs  do  (and  should)  match  objective  measures  of 
functioning  (Daltroy  et  al.,  1995,  Hoeymans  et  al„  1996),  if  predictive  power  of  future  health  problems  is 
a  metric  against  which  to  evaluate  health  indicators,  ADLS  appear  to  perform  fairly  well.  Studies  have 
found  that  scales  of  physical  functioning  based  on  ADLs  are  significant  predictors  of  subsequent  mortality, 
net  of  covariates  such  as  age  and  even  self-reported  GHS  (Reuben,  Siu,  and  Kimpau,  1992;  Scott  et  al., 
1997).  Nevertheless,  they  are  not  without  problems.  The  notions  "difficulty"  and  "heavy"  are  not 
explained  in  the  survey  and  depend  on  the  respondent’s  own  perception.  For  some  respondents,  the  ADLs 
are  outside  of  the  range  of  their  own  experience.  And,  for  many  of  the  ADLs  that  are  commonly  included 
in  surveys,  few  prime  age  respondents  report  difficulties  completing  the  activity.12  This  is  reflected  in 
the  regressions  in  Table  2B  which  indicate  that  the  incidence  of  difficulties  increases  substantially  with 
age  although  the  age  at  which  difficulties  emerge  is  lower  than  that  typically  observed  in  higher  income 
countries  (see  Strauss  et  al.,  1993,  for  a  detailed  discussion). 


12This  suggests  there  would  be  a  substantial  pay-off  to  the  development  of  ADL-type  questions  that  discriminate 
among  prime  age  adults. 
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The  striking  result  in  the  regressions  in  Table  2B  is  the  absence  of  a  negative  correlation  between 
SES  and  difficulty  with  either  ADL  (except  in  the  case  of  men  with  little  education  and  carrying  a  heavy 
load).  In  fact,  among  women  in  the  top  half  of  the  distribution  of  PCE,  a  higher  level  of  PCE  is  associated 
with  wore  difficulty  carrying  a  heavy  load.  The  same  is  not  true  of  walking  5  kms  nor  is  it  true  of  men; 
one  is  tempted  to  conclude  that  the  act  of  carrying  a  heavy  load  is  not  something  higher  income  women 
view  as  pail  of  their  daily  activities,  and  reporting  difficulties  is  more  a  reflection  of  perception  of  the 
appropriateness  of  the  activity  than  physical  constraints. 

Columns  5  through  8  report  the  relationship  between  GHS  and  respondent  characteristics.  The 
characteristics  associated  with  respondents  who  report  themselves  as  being  in  good  health  are  in  the  first 
two  columns;  the  characteristics  associated  with  being  in  poor  health  are  in  the  second  pair.  Two  issues 
cloud  interpretation  of  these  responses.  First,  it  is  unclear  what  reference  the  respondent  uses  as  the 
benchmark:  is  it  the  average  person  in  the  survey  (country),  in  the  community  or  someone  in  the 
respondent’s  peer  group?  While  this  can,  in  principle,  be  addressed  by  explicitly  specifying  the  reference 
group  in  the  interview,13  it  is  likely  that  a  respondent’s  notion  of  "good"  health  will  be  influenced  by  his 
or  her  own  experience  and  will  thus  vary  with  socio-economic  status,  interaction  with  the  health  system 
and  so  on. 

Results  for  physical  assessments  and  ADLs  indicate  that  health  status  tends  to  decline  with  age; 
the  age  profile  for  GHS  is  substantially  more  muted,  especially  in  the  case  of  respondents  who  report 
themselves  as  being  in  good  health.  If  all  respondents  use  the  average  in  the  country  as  the  reference,  we 
would  expect  a  decline  in  health  status  with  age  that  is  comparable  with  the  declines  observed  for  physical 
assessments  and  ADLs;  if  the  respondents  use  someone  of  the  same  age  as  the  reference,  there  would  be 
no  systematic  relationship  with  age.  The  estimated  age  profile  suggests  that  no  single  reference  is  used 
by  all  respondents  and  that  the  reference  health  amounts  to  some  combination  of  the  national  average  and 
the  respondent's  peer  group.  This  complicates  drawing  inferences  about  0'  with  these  sorts  of  data. 

Moreover,  even  among  those  who  report  themselves  in  poor  health,  there  are  marked  differences 
between  males  and  females.  A  significant  incidence  of  poor  health  emerges  much  earlier  among  females 
(during  their  thirties)  than  males  (during  their  fifties)  —  a  pattern  also  observed  for  carrying  a  heavy  load. 
Waldron  (1982)  and  others  have  written  extensively  on  this  topic  and  suggest  that  perceptions  of  health 


13See,  for  example,  the  Matlab  Health  and  Social  Survey  in  Bangladesh  and  pilot  studies  conducted  by  WHO. 
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(Ej)  differ  systematically  between  men  and  women.  Contrasting  the  age  profiles  in  Table  2B,  it  appears  that 
these  differences  vary  with  the  specific  health  measure  (i.e.,  in  context  of  the  model  above,  Ey  differs 
between  men  and  women). 

The  issue  of  the  reference  health  emerges  again  when  we  consider  the  relationship  between  GHS 
and  SES.  If  the  reference  is  the  national  average,  we  would  expect  GHS  to  improve  with  SES;  if  the 
reference  is  one’s  peer  group  (which  is  typically  comprised  of  people  with  similar  levels  of  SES),  there 
is  no  reason  to  expect  an  association.  We  find  no  association  between  SES  and  the  probability  that  a 
respondent  reports  himself  or  herself  as  being  in  good  health.  There  is,  however,  a  negative  association 
between  the  probability  of  reporting  poor  health  and  education,  but  only  among  those  who  have  completed 
primary  school.  If  these  patterns  reflect  differences  in  the  reference  health,  they  suggest  a  complex 
interaction  among  different  measures  of  SES,  levels  of  SES,  age  and  gender. 

The  lack  of  a  clearly  defined  reference  health  status  is  not  the  only  problem  with  interpreting  GHS. 
Rather,  the  observed  patterns  probably  also  reflect  differences  in  perceptions  of  what  is  "good"  or  "poor" 
health  as  well  as  differences  in  the  information  respondents  have  about  their  health. 

Dow  et  al.  (1999)  present  some  evidence  supporting  the  second  contention.  Drawing  on  data  from 
two  social  experiments,  one  conducted  in  the  United  States  and  one  conducted  in  Indonesia,  they  show  that 
as  the  price  of  health  care  is  increased,  there  is  a  decline  in  use  of  care.  However,  GHS  tends  to  improve 
where  prices  are  raised  and,  conversely,  GHS  worsens  where  prices  are  lowered.  If  one  were  to  interpret 
that  evidence  at  face  value,  one  would  conclude  that  raising  the  price  of  health  care  improves  health.  In 
fact,  when  physical  assessments  of  health  are  examined,  the  evidence  suggests  that  0  improved  when 
prices  were  reduced  and  respondents  saw  health  professionals  more  often.  Dow  et  al.  conclude  that  it  is 
not  seeing  the  doctor  that  makes  one  sick,  but  seeing  the  doctor  does  increase  one’s  information  set  and 
thereby  influences  one’s  perception  of  one’s  health.  Since  both  experiments  involved  randomization  of 
the  treatment,  the  analyses  are  not  contaminated  by  the  potential  for  reverse  causality  —  that  those  who  see 
the  doctor  more  often  are  more  likely  to  report  themselves  as  ill.  See  Newhouse  et  al.,  (1998)  for  a  fuller 
description  of  the  U.S.  experiment. 

The  last  two  health  indicators  in  Table  2B  are  the  number  of  days  the  respondent  reports  having 
had  to  limit  his  or  her  normal  activity  in  the  previous  four  weeks  and  the  number  of  days  spent  in  bed 
during  that  time.  Many  studies  have  argued  that  because  these  indicators  are  conceptually  very  clear  and 
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do  not  demand  judgements  of  the  respondent  about  the  meaning  of  "poor"  or  "difficulty,"  they  are  likely 
to  be  rather  good  measures  of  health.  We  are  more  sanguine. 

It  is  well  known  in  developed  countries  that  the  incidence  of  reporting  limited  activity  rises 
substantially  when  people  are  applying  for  or  are  receiving  public  assistance  that  is  linked  to  disability. 
This  is  less  likely  to  be  a  problem  in  most  developing  countries  where  such  assistance  is  not  common. 
It  has  also  been  suggested  that  people  who  are  unemployed  (or  who  have  retired)  are  more  likely  to  report 
limited  activity  as  a  rationalization  for  their  not  working. 

In  low  income  settings,  however,  it  is  not  obvious  that  "limited  activity"  is  well-defined. 
Moreover,  it  is  quite  likely  that  health  problems  that  limit  the  activity  of  a  laborer  are  quite  different  -  and 
possibly  more  serious  —  than  those  that  limit  the  activity  of  a  sedentary  worker.  The  meaning  of  "limited 
activity"  is  likely  to  vary  across  SES.  It  is  also  likely  to  vary  with  employment  status:  a  self-employed 
person  whose  income  is  directly  tied  to  his  or  her  being  actively  at  work  is  less  likely  to  have  limited 
normal  daily  activities  (failed  to  go  to  work)  than  someone  whose  income  includes  such  benefits  as  sick 
pay  (and  so  may  miss  a  day  of  work  without  incurring  an  income  penalty).  The  meaning  of  "limited 
activity"  is  likely,  once  again,  to  vary  systematically  with  SES. 

The  regression  results  support  this  view.  In  contrast  with  the  physical  assessments,  for  males,  there 
is  no  association  between  SES  and  either  the  number  of  days  of  limited  activity  or  the  number  of  bed  days. 
Among  females,  the  number  of  days  of  limited  activity  declines  with  education  but  increases  with  PCE 
(among  women  in  the  lower  half  of  the  PCE  distribution)  suggesting  that  among  these  women,  as  income 
increases,  they  are  more  likely  to  be  sick.  We  suspect  that  is  not  the  case  and  that,  rather,  they  are  more 
likely  to  report  their  activities  are  limited  by  health  problems. 

As  Sadana  (2000)  shows,  it  is  possible  to  construct  questions  about  specific  activities  which  are 
informative  about  health.  As  she  notes,  these  kinds  of  questions  need  to  be  tailored  to  the  particular 
context  and  sub-population  of  interest  which  limits  their  value  in  national  and  cross-national  studies. 

To  summarize,  the  evidence  in  Table  2B  suggests  that  it  is  difficult  to  interpret  self-reports  about 
the  kinds  of  general  indicators  of  health  status  that  are  commonly  collected  in  health  interviews.  The  more 
specific  the  question,  there  may  be  less  ambiguity  in  the  inteipretation  of  the  question  by  the  respondent, 
but,  unfortunately,  that  does  not  necessarily  translate  into  a  more  readily  interpreted  answer  by  the  analyst. 
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Table  2C  turns  to  specific  questions  about  morbidities.  As  discussed  above,  it  is  these  sorts  of 
questions  that  have  been  the  basis  for  the  majority  of  the  cross-validations  of  self-reports  and  clinical 
assessments  and  several  studies  have  described  their  strengths  and  weaknesses.  We  will  not,  therefore, 
dwell  on  them. 

In  IFLS2,  questions  were  asked  about  whether  the  respondent  had  experienced  a  particular 
morbidity,  reading  down  a  list  of  30  items.  If  so,  additional  questions  were  asked  about  the  nature  of  the 
morbidity  in  some  cases  —  such  as  the  type  of  coughing  —  to  elicit  more  information.  Six  morbidities  are 
included  in  Table  2C.  In  general,  the  better  educated  are  less  likely  to  report  the  presence  of  a  morbidity; 
there  is  little  association  with  PCE. 

The  contrast  between  coughing  and  breathing  difficulties  is  informative.  The  regressions  suggest 
higher  PCE  women  are  more  likely  to  report  they  suffer  from  coughing  problems.  They  are  not,  however, 
more  likely  to  report  having  breathing  difficulties  and,  recall,  they  do  not  tend  to  have  lower  levels  of  lung 
capacity.  One  suspects  that  the  positive  link  between  SES  and  reported  coughing  reflects,  in  part, 
differences  in  what  is  considered  to  be  a  "cough"  across  the  SES  distribution,  a  difference  that  does  not 
carry  through  to  the  more  general  term,  "breathing  difficulty."  A  similar  argument  might  be  made  for  the 
incidence  of  reported  "nausea"  which  rises  with  PCE  for  both  men  and  women  (among  those  below  median 
PCE).  Why  does  the  incidence  of  coughing  differ  across  the  SES  distribution?  It  is  possible  that  poorer 
women  are  not  aware  of  coughing  whereas  higher  income  women  are;  while  we  cannot  rule  that  out,  it 
seems  more  likely  to  us  that  poorer  women  consider  a  particular  amount  of  coughing  to  be  normal  whereas 
higher  income  women  report  that  same  amount  of  coughing  as  an  illness.  (See  Bhatia  and  Cleland,  1995.) 
It  is  clear  that  the  precise  nature  of  questions  about  morbidities  makes  a  difference.  Sindelar  and  Thomas 
(1991)  argue  that  evidence  from  Peru  suggests  relative  to  diagnoses  (which  often  involve  interaction  with 
the  health  system)  questions  about  symptoms  are  less  prone  to  differences  in  interpretation  across  the  SES 
distribution.  See  also  Zurayk  et  al.  (1995)  and  Murray  and  Chen  (1992). 

4.  Self  reports  and  physical  assessments 

If  self-reports  are  contaminated  by  e,  can  they  be  used  to  say  anything  about  0  ?  To  address  this 
question.  Tables  3  and  4  present  the  relationships  between  self-reports  and  physical  assessments. 
Substituting  [5]  in  [4]  we  estimate: 
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[6] 


0|j  =  Wj  +  ei(x)  +  Ej  +  Ei/x)  +  ej  +  eij  +  « jj 
to  provide  an  empirical  assessment  of  the  links  between  self-reports  of  health,  0s  and  physical  measures, 
0M.  If  they  are  unrelated,  then  P  will  be  zero  indicating  that  the  self-report  conveys  no  information  about 
any  of  the  physical  assessments. 

We  begin  with  GHS  in  Table  3. 14  The  physical  assessments  do  not  do  a  good  job  of 
discriminating  between  respondents  in  average  health  and  those  in  good  health.  Even  the  healthworker 
evaluation  is  unable  to  discriminate  between  these  states.  (In  fact,  those  who  take  longer  to  stand  from 
a  sitting  position  are  more  likely  to  report  themselves  in  better  health!)  In  contrast,  the  assessments,  0M, 
do  much  better  in  isolating  those  who  report  themselves  as  being  in  poor  health;  this  is  true  for  lung 
capacity,  blood  pressure,  hemoglobin  levels  and  BMI.  It  is  also  the  case  that  the  healthworker  evaluation 
has  predictive  power  that  is  independent  of  these  assessments.  (This  does  not  reflect  reverse  causality 
since,  recall,  the  healthworker  does  not  know  how  the  respondent  answered  the  questions  about  self- 
assessed  health.)  There  can  be  little  argument  that  poor  GHS  does  contain  information  about  0\ 

There  is,  however,  cause  for  caution.  In  the  model  [6],  SES  will  predict  0s  if  there  are  unmeasured 
aspects  of  health  that  are  correlated  with  SES.  In  that  case,  given  the  positive  correlation  between  SES 
and  physical  assessments,  we  would  expect  00s/5x  to  be  positive.  If,  conditional  on  measured  health,  there 
are  differences  in  perceptions  of  health  across  the  SES  distribution,  then  those  will  enter  through  e  in  [6]. 
Those  effects  are  likely  to  be  negative  as  higher  SES  is  likely  associated  with  a  greater  propensity  to  report 
poor  health.  Thus,  if  00s/0x  is  negative,  then  the  reporting  effects  must  dominate  the  role  of 
unobservables.  For  the  case  of  females  who  report  themselves  as  being  in  poor  health,  this  turns  out  to 
be  the  case:  among  those  below  median  PCE,  as  PCE  increases,  women  are  more  likely  to  report 
themselves  as  being  in  poor  health.  A  similar  pattern  emerges  among  women  with  5  years  or  less  of 
schooling,  although  the  reverse  is  true  among  the  better  educated.  We  conclude  that  for  GHS  the  links 
between  0s,  0M  and  e  are  complex  and  complicate  interpretation  of  self-reported  GHS. 

Table  4A  focuses  attention  on  the  two  ADLs.  The  first  regression  in  each  pair  includes  the 
physical  assessments.  They  all  tend  to  predict  difficulties  with  walking  and  carrying  a  heavy  load 
suggesting  that  the  ADLs  are  also  doing  a  good  job  of  capturing  some  dimensions  of  these  assessments. 


14The  regressions  include  controls  for  age  (spline)  and  location  of  residence. 


They  are  suppressed  from  the  tables. 
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(There  are,  however,  some  instances  in  which  the  relationships  are  not  as  one  might  expect  —  higher  lung 
capacity  is  associated  with  greater  difficulty  carrying  a  heavy  load  among  men.) 

The  second  regression  in  each  pair  adds  self-reported  GHS  to  the  list  of  covariates.  We  have  two 
motivations.  First,  we  wish  to  assess  the  extent  to  which  GHS  is  able  to  soak  up  the  predictive  power  of 
the  physical  assessments.  Given  that  all  the  assessments  are  continuous  indicators  and  GHS  is  simply  a 
trichotomous  variable,  the  fact  that  significant  coefficients  decline  by  between  10  and  30  per  cent  indicates 
GHS  does  contain  information  about  0'  . 

Our  second  motivation  for  including  GHS  in  the  regression  is  to  examine  the  extent  to  which  it 
is  correlated  with  other  self-reported  health  measures  after  controlling  measured  health.  The  correlation 
is  high  and,  for  the  case  of  poor  health,  significant.  There  are  two  possible  interpretations.  Poor  GHS  is 
reflecting  unmeasured  aspects  of  health  which  are  also  reflected  in  the  ADLs.  Or,  we  are  simply  picking 
up  the  effect  of  individual  idiosyncratic  reporting  effects,  ci9  which  are  common  to  both  GHS  and  the 
ADLs.  It  is  not  possible  to  distinguish  between  these  hypotheses  with  these  data. 

Table  4B  presents  results  for  morbidities  and  activity  limitations;  we  report  only  those 
specifications  that  include  all  physical  assessments  and  GHS.  Respondents  with  low  lung  capacity  (and 
women  with  low  BMI)  are  more  prone  to  coughing  and  breathing  difficulty;  fever  likely  emanates  from 
a  number  of  sources  and  is  associated  with  most  of  the  physical  assessments;  neither  joint  pain  nor  nausea 
are  well-predicted  with  the  physical  assessments  included  in  IFLS2;  BMI  appears  to  predict  the  number 
of  days  of  limited  activity.  For  all  of  these  self-reports,  GHS  is  significantly  correlated  with  the  self- 
reports.  In  the  case  of  days  of  limited  activity,  the  inclusion  of  GHS  increases  the  regression  R2  by  a 
factor  of  four.  We  suspect  that  correlated  reporting  effects,  Cj,  is  the  driving  force  behind  that  observation. 

The  regressions  have  been  re-estimated  including  PCE  and  education  of  the  respondent  to 
determine  whether  reporting  differences  that  are  correlated  with  SES  persist  after  controlling  measured 
health  and  GHS.  The  results  are  reported  for  a  subset  of  the  self-reported  health  indicators  in  Table  5;  all 
regressions  also  include  the  full  set  of  covariates  listed  in  Table  4. 

Controlling  measured  health  and  GHS,  better  educated  men  are  more  likely  to  report  having 
difficulty  carrying  a  heavy  load.  As  PCE  increases,  women  in  households  with  PCE  above  the  median  are 
more  likely  to  report  having  difficulty  carrying  a  heavy  load.  It  would  seem  that  this  ADL  is  not  immune 
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to  differences  in  perceptions  of  what  is  meant  by  the  question  across  the  SES  distribution  for  both  men 
and  women. 

Women  in  households  with  PCE  below  the  median  are  more  likely  to  report  having  a  cough  and 
suffering  from  nausea  as  PCE  increases;  the  same  is  true  for  the  probability  a  male  reports  suffering  from 
nausea.  Increases  in  PCE  in  these  households  is  also  associated  with  a  greater  probability  a  woman  reports 
health  has  limited  her  activities. 

The  fact  that  SES  is  not  correlated  (or  is  negatively  correlated)  with  the  other  self-reported  health 
indicators  does  not  mean  that  those  indicators  are  not  contaminated  by  reporting  effects.  Rather,  the 
evidence  in  Table  5  indicates  that  the  propensity  to  report  health  problems  increases  with  SES  and  this 
propensity  dominates  the  positive  relationship  between  SES  and  improved  health  evident  in  Figure  2.  This 
provides  a  powerful  test  of  the  hypothesis  that  interpretation  of  self-reported  health  indicators  is  far  from 
straightforward. 

5.  Conclusions 

Using  data  from  NHANES3  and  IFLS2,  we  have  examined  the  links  between  self-reported  health 
status,  physical  assessments  of  health  and  SES.  The  empirical  results  highlight  the  complexity  of  the 
interactions  among  these  indicators  —  without  even  venturing  into  the  even  more  complex  arena  of  mental 
health,  pain  and  health  difficulties  that  are  very  difficult  to  diagnose. 

There  can  be  little  doubt  that  self-reports  of  health  obtained  in  health  interview  surveys  contain 
important  information  about  the  health  status  of  respondents.  There  is,  however,  evidence  that  the  self- 
reports  also  contain  information  about  the  respondents*  own  characteristics  —  their  education,  their  standard 
of  living,  their  interaction  with  the  health  system  and  their  beliefs  about  what  is  "good  health.” 
Disentangling  these  pieces  is  not  straightforward  (see,  also,  Sadana,  et  al,  2001). 

There  are  likely  to  be  very  substantial  pay-offs  to  investments  in  collecting  and  disseminating  more 
experimental  and  quasi-experimental  evidence  on  the  links  between  self-reports,  physician  assessments  and 
respondent  characteristics.  Studies  of  the  introduction  of  disability  income  for  workers  whose  health 
prevented  them  from  working  has  yielded  important  insights  into  the  way  self-reports  have  responded  to 
changes  in  economic  incentives  (Parsons,  1991).  There  are  a  large  number  of  other  potential  quasi¬ 
experiments  --  health  interventions,  introduction  of  new  or  improved  services,  information  campaigns, 
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changes  in  prices  of  care  or  changes  in  health  insurance  schemes.  By  exploiting  the  variation  induced  by 
such  (quasi-)exogenous  changes  in  the  environment  that  people  live  in,  and  by  contrasting  the  impact  of 
the  changes  on  perceptions  of  health  and  health  assessments,  it  should  be  possible  to  shed  some  light  on 
the  mechanisms  that  underlie  the  evolution  of  self-reported  health  status. 

Now  is  a  good  time  to  conduct  these  sorts  of  studies.  The  rate  of  technological  advances  in  the 
measurement  of  health  status  is  nothing  short  of  stunning,  and  it  is  becoming  increasing  feasible  to  conduct 
tests  for  many  common  ailments  in  a  field  setting  as  part  of  a  large-scale  household  survey  that  also 
contains  a  battery  of  self-reported  health  indicators,  information  about  respondent  behaviors  (such  as  use 
of  health  care,  employment,  daily  activities)  and  good  measures  of  socio-economic  status.  If  these  data 
are  collected  longitudinally,  it  will  be  possible  to  exploit  the  variation  that  arises  as  social,  economic  and 
demographic  environments  change  either  because  of  shifts  in  policy  or  because  of  external  forces.  There 
also  seems  to  be  considerable  scope  for  learning  about  the  meaning  of  self-reports  through  increased 
experimentation  in  questions  included  in  interview  surveys  that  contain  large  samples  of  respondents  who 
span  a  wide  spectrum  of  SES  and  behaviors. 

The  collection  of  survey  data  that  integrates  self-reports  of  health,  clinical  or  physical  assessments 
and  includes  substantial  heterogeneity  in  respondent  backgrounds  is  an  important  first  step.  As  those  data 
are  placed  in  the  public  domain  and  extensively  analyzed,  so  they  will  surely  serve  as  powerful  tools  for 
scientists  to  make  more  headway  in  understanding  the  meaning  of  health  status  indicators  collected  in 
social  surveys  and,  thereby,  reveal  information  about  the  health  of  a  population  that  can  be  interpreted  with 
confidence. 
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Figure  1:  Height  and  weight  of  adults 
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Table  1:  Difference  between  reported  anthropometry  and  measured  anthropometry 

United  States  NHANES  3 


Reported  height-  Reported  weight-  BMKbased  on  reports)- 

measured  height  measured  weight  BMKbased  on  measurements) 


Male 

Female 

Male 

Female 

Male 

Female 

Age  20-40  (spline) 

0.002 

-0.009 

0.024 

0.027 

0.009 

0.011 

[0.17] 

[0.91] 

[2.1] 

[l.%] 

[1.97] 

[1.71] 

Age  40-60 

0.035 

0.025 

-0.01 1 

0.01 

-0.016 

-0.005 

[3.56] 

[2.39] 

[1.24] 

[0.75] 

[3.69] 

[0.8] 

Age  60-80 

0.113 

0.152 

0.003 

0.032 

-0.03 

-0.033 

[9.85] 

[12.08] 

[1.86] 

[3.2] 

[6.24] 

[5.56] 

Yrs  education 

0.050 

-0.104 

-0.053 

-0.072 

-0.035 

0.013 

[2.75] 

[4.12] 

[2.95] 

[3.32] 

[4.18] 

[0.94] 

ln(per  capita  income) 

-0.085 

-0.109 

-0.057 

-0.02 

0.002 

0.031 

(1)  if  self-report 

[1.28] 

[1.56] 

[0.73] 

[0.23] 

[0.06] 

[0.7] 

as  overweight 

0.12 

0.047 

-1.541 

-1.43 

-0.627 

-0.645 

[1.14] 

[0.39] 

[12.07] 

[10.43] 

[11.74] 

[9.35] 

as  underweight 

0.048 

0.017 

1.253 

1.069 

0.471 

0.447 

[0.24] 

[0.05] 

[6.19] 

[5.47] 

[5.77] 

[4.25] 

Intercept 

0.92 

2.63 

1.413 

-0.737 

0.246 

-1.102 

[1-38] 

[4.21] 

[1.95] 

[0.93] 

[0.82] 

[2.84] 

F(all  covariates) 

44.51 

59.13 

40.58 

41.13 

50.21 

27.43 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

R2 

0.096 

0.149 

0.064 

0.058 

0.085 

0.037 

Notes:  Sample  is  3,870  males  and  3,682  females  age  20  through  90  interviewed  and  measured  in  NHANES3  (1988-1994).  t 

statistics  in  parentheses  based 

on  Huber-type  variance-covariance  estimation  based  on  influence  function  method. 

Table  2A:  Physical  health  assessments  and  respondent  characteristics 
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Table  2B:  Self-reported  health  and  respondent  characteristics 
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estimation  based  on  influence  function  method  allowing  within-cluster  correlations. 


Table  2C:  Self-reported  morbidities  and  respondent  characteristics 
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estimation  based  on  influence  function  method  allowing  within-cluster  correlations. 


Table  3:  General  health  status,  physical  health  and  SES 


Male 

(1) 

GHS  good 

Female 
(2)  (1) 

(2) 

Male 

(1) 

GHS  poor 

Female 

(2)  (1) 

(2) 

ln(PCE)  below  median  (spline) 

0.588 

-0.268 

-0.96 

1.702 

[0.63] 

[0.34] 

[0.93] 

[2.02] 

--  above  median  (spline) 

-0.061 

0.265 

0.435 

-0.74 

[0.07] 

[0.36] 

[0.5] 

[0.95] 

Education  0-5  yrs  (spline) 

-0.008 

0.094 

-0.176 

0.478 

[0.03] 

[0.47] 

[0.55] 

[1-91] 

—  >-6  yrs  (spline) 

-0.095 

0.083 

-0.197 

-0.571 

[0.7] 

[0.57] 

[1.45] 

[3.72] 

Lung  capacity  <median  (spline) 

-0.006 

-0.008 

0.003 

0.002 

-0.098 

-0.096 

-0.021 

-0.017 

[0.68] 

[0.83] 

[0.4] 

[0.2] 

[4.99] 

[4.75] 

[2.05] 

[1.71] 

—  >median  (spline) 

0.011 

0.012 

0.009 

0.01 

0.000 

0 

-0.018 

-0.015 

[2.12] 

[2.19] 

[0.58] 

[0.62] 

[0.03] 

[0.03] 

[1.4] 

[1.16] 

Sit->stand  time  <median  (spline) 

0.903 

0.938 

1.268 

1.294 

0.559 

0.554 

0.763 

0.818 

[1.69] 

[1.72] 

[2.47] 

[2.48] 

[1.29] 

[1.25] 

[1.44] 

[1.54] 

—  >median  (spline) 

0.102 

0.119 

-0.165 

-0.136 

0.454 

0.494 

0.376 

0.305 

[0.55] 

[0.61] 

[0.99] 

[0.8] 

[1.4] 

[1.46] 

[1.47] 

[1.19] 

BP  diastolic  <150  (spline) 

0.078 

0.079 

-0.051 

-0.051 

-0.069 

-0.068 

-0.062 

-0.066 

[2.09] 

[2.09] 

[1.49] 

[1.44] 

[1-53] 

[1-47] 

[1.61] 

[1.68] 

--  >150  (spline) 

-0.08 

-0.078 

-0.024 

-0.022 

0.163 

0.166 

0.115 

0.112 

[2.09] 

[2.01] 

[0.91] 

[0.85] 

[2.52] 

[2.54] 

[2.41] 

[2.32] 

BP  systolic  <90  (spline) 

-0.025 

-0.035 

0.057 

0.051 

0.14 

0.143 

-0.077 

-0.079 

[0.46] 

[0.63] 

[1.17] 

[1.04] 

[2.51] 

[2.51] 

[1.26] 

[1.27] 

—  >90  (spline) 

0.074 

0.077 

-0.102 

-0.108 

-0.054 

-0.074 

0.175 

0.183 

[0.74] 

[0.75] 

[1.48] 

[1.55] 

[0.41] 

[0.55] 

[1.52] 

[1.58] 

Hemoglobin  <12  (spline) 

-0.632 

-0.552 

0.296 

0.312 

-1.435 

-1.199 

-1.618 

-1.606 

[1.07] 

[0.92] 

[0.94] 

[0.98] 

[1.58] 

[1.31] 

[2.95] 

[2.89] 

--  >12  (spline) 

0.634 

0.725 

0.65 

0.701 

-0.241 

-0.209 

0.724 

0.762 

[2.26] 

[2.52] 

[1.68] 

[1.8] 

[0.88] 

[0.76] 

[1.63] 

[1.7] 

BMI  <18  (spline) 

1.079 

1.012 

0.484 

0.501 

-4.233 

-4.428 

-3.363 

-3.67 

[1.63] 

[1.48] 

[1-09] 

[1.11] 

[3.29] 

[3.38] 

[3.63] 

[3.85] 

—  18-28  (spline) 

0.038 

0.055 

0.166 

0.156 

-0.411 

-0.359 

-0.137 

-0.182 

[0.24]  ' 

[0.32] 

[1.27] 

[1.15] 

[2.48] 

[2.16] 

[0.99] 

[1.3] 

—  >28  (spline) 

-0.391 

-0.347 

0.668 

0.673 

0.41 

0.584 

-0.227 

-0.286 

[0.42] 

[0.36] 

[1.28] 

[1.28] 

[0.45] 

[0.62] 

[0.48] 

[0.61] 

Height  (cms) 

0.022 

0.018 

0.005 

-0.006 

0.012 

0.024 

-0.004 

0.001 

[0.35] 

[0.28] 

[0.09] 

[0.11] 

[0.16] 

[0.33] 

[0.06] 

[0.01] 

HW  evaluation 

0.286 

0.286 

0.096 

0.088 

-1.601 

-1.429 

-1.58 

-1.551 

[0.51] 

[0.5] 

[0.23] 

[0.21] 

[3.58] 

[3.17] 

[3.2] 

[3.07] 

Age  20-30  (spline) 

-0.415 

-0.46 

0.195 

0.214 

0.085 

0.092 

0.14 

0.09 

[1-96] 

[2.14] 

[1.24] 

[1.33] 

[0.55] 

[0.6] 

[1.01] 

[0.65] 

Age  30-40  (spline) 

-0.127 

-0.142 

-0.196 

-0.187 

0.009 

-0.029 

0.417 

0.38 

[0.79] 

[0.88] 

[1.39] 

[1.26] 

[0.07] 

[0.21] 

[2.94] 

[2.66] 

Age  40-50  (spline) 

-0.036 

-0.021 

-0.213 

-0.216 

0.185 

0.165 

0.298 

0.337 

[0.26] 

[0.15] 

[1-49] 

[1.49] 

[1.1] 

[0.97] 

[1.68] 

[1.88] 

Age  50-60  (spline) 

-0.486 

-0.496 

-0.232 

-0.234 

0.514 

0.563 

0.124 

0.144 

[2.89] 

[2.9] 

[1-72] 

[1.67] 

[2.24] 

[2.39] 

[0.49] 

[0.55] 

Age  60-70  (spline) 

-0.043 

-0.058 

-0.129 

-0.118 

0.535 

0.431 

0.211 

0.281 

[0.26] 

[0.35] 

[0.88] 

[0.8] 

[1.46] 

[1.17] 

[0.57] 

[0.76] 

Age  70-80  (spline) 

0.054 

0.057 

0.217 

0.21 

-1.217 

-1.157 

1.775 

1.694 

[0.26] 

[0.27] 

[0.91] 

[0.87] 

[1.73] 

[1.63] 

[L98] 

[1.89] 

(1)  urban 

0.638 

0.658 

1.544 

1.471 

-2.008 

-1.325 

1.414 

1.81 

[0.64] 

[0.62] 

[1.86] 

[1.65] 

[2.18] 

[1.36] 

[E43] 

[1.7] 

F(all  covariates) 

12.12 

10.95 

14.27 

13.41 

12.87 

11.28 

10.43 

10.83 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

R2 

0.092 

0.094 

0.062 

0.064 

0.092 

0.093 

0.067 

0.069 

Notes:  Sample  is  6,435  males  and  7,658  females  age  20  through  80  interviewed  and  measured  in  IFLS2  (1997).  t  statistics  in  parentheses 
based  on  Huber-type  variance-covariance  estimation  based  on  influence  function  method  allowing  within-cluster  correlations. 


Table  4A:  Self-reported  ADLs,  physical  health  and  GHS 


Walk  5kms 
with  difficulty 
Male  Female 


Carry  heavy  load 
with  difficulty 
Male  Female 


(1) 

(2) 

(1) 

(2) 

(1) 

(2) 

(1) 

(2) 

Lung  capacity  cmedian  (spline) 

-0.080 

-0.057 

-0.016 

-0.013 

-0.051 

-0.033 

-0.016 

-0.013 

[4.11] 

[2.94] 

[L24] 

[0.98] 

[2.82] 

[1.74] 

[1.52] 

[1.25] 

-  >median  (spline) 

0.005 

0.005 

0.003 

0.006 

0.009 

0.008 

0.013 

0.015 

[L02] 

[1.00] 

[0.16] 

[0.31] 

[2.44] 

[2.39] 

[0.92] 

[U2] 

Sit->stand  time  <median  (spline) 

0.6 

0.466 

1.077 

0.945 

0.56 

0.452 

0.345 

0.209 

[1.58] 

[1.29] 

[1.49] 

[1.31] 

[1.78] 

[1.54] 

[0.51] 

[0.32] 

-  >median  (spline) 

1.369 

1.263 

0.971 

0.907 

0.839 

0.753 

1.239 

1.189 

[4.05] 

[4.07] 

[3.46] 

[3.28] 

[2.44] 

[2.35] 

[4.52] 

[4.45] 

BP  diastolic  <150  (spline) 

-0.065 

-0.049 

-0.037 

-0.026 

-0.052 

-0.039 

-0.017 

-0.007 

[1-48] 

[1.2] 

[0.76] 

[0.54] 

[1.52] 

[1.19] 

[0.39] 

[0.15] 

--  >150  (spline) 

0.17 

0.132 

0.129 

0.109 

0.176 

0.145 

0.119 

0.103 

[2.6] 

[2.12] 

[2.35] 

[2.01] 

[3.03] 

[2.66] 

[2.35] 

[2.08] 

BP  systolic  <90  (spline) 

0.13 

0.097 

-0.006 

0.008 

0.067 

0.041 

-0.029 

-0.019 

[2.2] 

[1-71] 

[0.07] 

[0.1] 

[1.54] 

[0.94] 

[0.43] 

[0.29] 

-  >90  (spline) 

-0.042 

-0.03 

0.1 

0.071 

-0.075 

-0.065 

0.065 

0.042 

[0.31] 

[0.23] 

[0.77] 

[0.55] 

[0.67] 

[0.59] 

[0.55] 

[0.36] 

Hemoglobin  <12  (spline) 

-1.217 

-0.879 

0.236 

0.51 

-1.037 

-0.763 

-0.905 

-0.68 

[1.56] 

[1.25] 

[0.36] 

[0.79] 

[1.38] 

[1.09] 

[1.76] 

[1.37] 

—  >12  (spline) 

0.077 

0.131 

-1.175 

-1.299 

0.375 

0.419 

-0.795 

-0.913 

[0.3] 

[0.51] 

[1.94] 

[2.15] 

[1.85] 

[2.06] 

[1.65] 

[1.91] 

BMI  <18  (spline) 

-3.896 

-2.909 

-3.602 

-3.033 

-4.037 

-3.24 

-2.546 

-2.075 

[3.64] 

[2.81] 

[3.61] 

[3.03] 

[3.74] 

[3.28] 

[2.39] 

[1.96] 

-  18-28  (spline) 

-0.063 

0.034 

0.024 

0.047 

-0.137 

-0.059 

-0.32 

-0.304 

[0.37] 

[0.21] 

[0.12] 

[0.24] 

[1.00] 

[0.46] 

[2.00] 

[1.92] 

—  >28  (spline) 

1.067 

0.972 

2.02 

2.057 

0.34 

0.263 

2.169 

2.187 

[0.92] 

[0.87] 

[2.73] 

[2.79] 

[0.49] 

[0.38] 

[3.22] 

[3.22] 

Height  (cms) 

-0.094 

-0.097 

-0.177 

-0.176 

-0.107 

-0.109 

-0.232 

-0.231 

[1-51] 

[L61] 

[1-96] 

[1.96] 

[1.96] 

[2.04] 

[3.15] 

[3.17] 

HW  evaluation 

-0.166 

0.207 

0.953 

1.22 

0.003 

0.305 

0.512 

0.736 

[0.42] 

[0.53] 

[1.57] 

[2.02] 

[0.01] 

[0.9] 

[0.86] 

[1.25] 

( 1 )  self  report  GHS=poor 

0.234 

0.170 

0.190 

0.143 

[12.45] 

[9.77] 

[11.86] 

[10.01] 

(1)  self  report  GHS=good 

0.004 

0.002 

0.003 

0.021 

[0.38] 

[0.08] 

[0.46] 

[1-27] 

Age  20-30  (spline) 

-0.254 

-0.272 

-0.068 

-0.092 

-0.032 

-0.047 

-0.439 

-0.463 

[2.2] 

[2.42] 

[0.32] 

[0.44] 

[0.44] 

[0.62] 

[2.72] 

[2.87] 

Age  30-40  (spline) 

0.357 

0.356 

0.127 

0.057 

0.139 

0.138 

0.325 

0.27 

[2.85] 

[2.97] 

[0.61] 

[0.27] 

[1.79] 

[1.84] 

[2.02] 

[1.68] 

Age  40-50  (spline) 

0.086 

0.043 

1.388 

1.338 

0.014 

-0.021 

0.711 

0.673 

[0.54] 

[0.28] 

[5.52] 

[5.32] 

[0.13] 

[0.18] 

[3.49] 

[3.33] 

Age  50-60  (spline) 

1.818 

1.7 

1.734 

1.714 

1.192 

1.096 

1.783 

1.77 

[6.82] 

[6.6] 

[4.96] 

[4.98] 

[5] 

[4.82] 

[5.74] 

[5.85] 

Age  60-70  (spline) 

1.398 

1.273 

1.035 

0.999 

1.51 

1.409 

1.188 

1.161 

[3.49] 

[3.33] 

[2.51] 

[2.42] 

[3.99] 

[3.91] 

[2.76] 

[2.75] 

Age  70-80  (spline) 

2.085 

2.369 

1 .403 

1.102 

1.984 

2.214 

2.21 

1.951 

[3.13] 

[3.72] 

[2.09] 

[1.66] 

[2.81] 

[3.14] 

[2.88] 

[2.68] 

( 1 )  urban 

0.183 

0.651 

2.862 

2.619 

0.557 

0.935 

3.447 

3.212 

[0.22] 

[0.8] 

[2.07] 

[1.92] 

[0.81] 

[1.44] 

[2.97] 

[2.78] 

F(all  covariates) 

29.33 

36.28 

54.16 

61.73 

15.53 

19.15 

46.47 

50.47 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

[0.00] 

R2 

0.238 

0.281 

0.166 

0.179 

0.218 

0.264 

0.194 

0.207 

Notes:  Sample  is  6,435  males  and  7,658  females  age  20  through  80  interviewed  and  measured  in  IFLS2  (1997).  t  statistics  in  parentheses 
based  on  Huber-type  variance-covariance  estimation  based  on  influence  function  method  allowing  within-cluster  correlations. 


Table  5:  Self-reported  health,  physical  health  and  SES 
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